SJTU-IPADS/reef: REEF is a GPU-accelerated DNN inference serving system that enables instant kernel preemption and biased concurrent execution in GPU scheduling.

Transparent GPU Sharing in Container Clouds for Deep Learning Workloads | USENIX

pkusys/TGS: Artifacts for our NSDI'23 paper TGSmulti

Using CUDA IPC memory handles in pytorch - PyTorch Forums

这里面有IPC方案

Resource

results matching ""

No results matching ""